IN THE CLAIMS: 



1 . A method for generating summaries of a video comprising the steps 

of: 

inputting summary sentences, visual information and a section-begin 
frame and a section-end frame for each story in a video; 
selecting a type of presentation; 
locating a set of images available for each story; 

auditing the summary sentences to generate a plurality of summary audio 
segments corresponding to an auditory narration of each story; 

composing the set of images to selectively match the set of images with 
the summary sentences to generate a plurality of summary image segments: 

matching said aud i t e d summary sentences audio segments with the s e t of 
summary images segments to generate a story summary video for each story in 
the video; and 

combining each of the generated story summaries to generate a summary 
of the video. 

2. The method of claim 1, wherein the visual information comprises at 
least one of a shotlist, a keyframelist and a combination thereof. 
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3. The method of claim 1, wherein the summary sentences are 
generated by: 

generating story boundaries and sentence data using a story extractor; 
selecting a length of a story summary; 

summarizing said sentence data to produce at least one summary 
sentence, wherein a number of the summary sentences produced corresponds to 
the length of the story summary; and 

ordering the at least one summary sentence based on its selection order. 

4. The method of claim 1, wherein the type of presentation comprises 
an image slide format. 

5. The method of claim 1 , wherein the type of presentation comprises 
a poster format. 

6. The method of claim 1, wherein the section-begin frame and the 
section-end frame determines a story boundary. 

7. The method of claim 1, wherein the step of locating the set of 
images further comprises the steps of: 

collecting a list of images within a story boundary; 
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generating a mergelist for clustering images corresponding to each shot 
into visually similar groups; 

deleting images belonging to a largest visually 
similar group; and 

sampling a remaining list of images to produce the set of images. 

8. The method of claim 7, wherein the sampling is performed 
uniformly with a sampling interval determined by a number of images desired for 
a given length of story summary. 

9. The method of claim 7, wherein the step of sampling further 
comprises selecting a frame number of each proper noun. 

10. A method for generating summaries of a video, comprising the 
steps of: 

inputting story summary sentences, video information and speaker 
segments for each story in a video; 

locating video clips for each story from said video information; 

capturing audio clips from the video clips, said audio clips corresponding 
to the summary sentences; 

composing the video clips to selectively match the video clips with the 
summary sentences to create a plurality of composed video clips: 
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combining said corresponding audio clips with the composed v ideo clips to 
generate a story summary video for each story in the video; and 

combining each of the generated story summaries to generate a summary 
of the video. 

11. The method of claim 10, wherein the summary sentences are 
generated by: 

generating story boundaries and sentence data using a story extractor; 
selecting a length of a story summary; 

summarizing said sentence data to produce at least one summary 
sentence, wherein a number of the summary sentences produced corresponds to 
the length of the story summary; and 

ordering the at least one summary sentence based on its selection order. 

12. A program storage device readable by a machine, tangibly 
embodying a program of instructions executable by the machine to perform 
method steps for generating summaries of a video, the method steps comprising 
the steps of: 

providing summary sentences, visual information and a section-begin 
frame and a section-end frame for each story in a video; 
selecting a type of presentation; 
locating a set of images available for each story; 
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auditing the summary sentences to generate a plurality of summary audio 
segments corresponding to an auditory narration of each story; 

composing the set of images to selectively match the set of images with 
the summary sentences to generate a plurality of summary image segments; and 

matching said aud i ted summary audio segments s e nt e nc e s with the s e t of 
summary images segments to generate a story summary video for each story in 
the video, wherein a summary of the video is generated by combining each of the 
generated story summaries. 

13. The program storage device of claim 12, wherein the visual 
information comprises at least one of a shotlist, a keyframelist and a combination 
thereof. 

14. The program storage device of claim 12, wherein the instructions 
for generating summary sentences comprise instructions for performing the steps 
of: 

generating story boundaries and sentence data using a story extractor; 
selecting a length of a story summary; 

summarizing said sentence data to produce at least one summary 
sentence, wherein a number of the summary sentences produced corresponds to 
the length of the story summary; and 

ordering the at least one summary sentence based on its selection order. 
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15. The program storage device of claim 12, wherein the type of 
presentation comprises an image slide format. 

16. The program storage device of claim 12, wherein the type of 
presentation comprises a poster format. 

17. The program storage device of claim 12, wherein the section-begin 
frame and the section-end frame determines a story boundary. 

18. The program storage device of claim 12, wherein the step of 
locating the set of images further comprises the steps of: 

collecting a list of images within a story boundary; 

generating a mergelist for clustering images corresponding to each shot 
into visually similar groups; 

deleting images belonging to a largest visually 
similar group; and 

sampling a remaining list of images to produce the set of images. 

19. The program storage device of claim 18, wherein the sampling is 
performed uniformly with a sampling interval determined by a number of images 
desired for a given length of story summary. 
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20. The program storage device of claim 18, wherein the step of 
pling further comprises selecting a frame number of each proper noun. 
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